Search CORE

47 research outputs found

Laplacian-Steered Neural Style Transfer

Author: Alexei
Gatys L. A.
Johnson Justin
Kingma Diederik
Lee J. H.
Zhang Hanwang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/07/2017
Field of study

Neural Style Transfer based on Convolutional Neural Networks (CNN) aims to synthesize a new image that retains the high-level structure of a content image, rendered in the low-level texture of a style image. This is achieved by constraining the new image to have high-level CNN features similar to the content image, and lower-level CNN features similar to the style image. However in the traditional optimization objective, low-level features of the content image are absent, and the low-level features of the style image dominate the low-level detail structures of the new image. Hence in the synthesized image, many details of the content image are lost, and a lot of inconsistent and unpleasing artifacts appear. As a remedy, we propose to steer image synthesis with a novel loss function: the Laplacian loss. The Laplacian matrix ("Laplacian" in short), produced by a Laplacian operator, is widely used in computer vision to detect edges and contours. The Laplacian loss measures the difference of the Laplacians, and correspondingly the difference of the detail structures, between the content image and a new image. It is flexible and compatible with the traditional style transfer constraints. By incorporating the Laplacian loss, we obtain a new optimization objective for neural style transfer named Lapstyle. Minimizing this objective will produce a stylized image that better preserves the detail structures of the content image and eliminates the artifacts. Experiments show that Lapstyle produces more appealing stylized images with less artifacts, without compromising their "stylishness".Comment: Accepted by the ACM Multimedia Conference (MM) 2017. 9 pages, 65 figure

arXiv.org e-Print Archive

Crossref

ScholarBank@NUS

Structural Material Property Tailoring Using Deep Neural Networks

Author: Bergerhoff G.
Erhan D.
Gatys L.
Goodfellow I
Kingma D.
Larsen A.B.L
LeCun Y
Li V.C.
Lütjering G.
Report of the AdHoc Committee on Air Force Aircraft Jet Engine Manufacturing and Production Processes
Sieniawsk
Simonyan K.
van der Maaten L.J.P.
Publication venue
Publication date: 29/01/2019
Field of study

Advances in robotics, artificial intelligence, and machine learning are ushering in a new age of automation, as machines match or outperform human performance. Machine intelligence can enable businesses to improve performance by reducing errors, improving sensitivity, quality and speed, and in some cases achieving outcomes that go beyond current resource capabilities. Relevant applications include new product architecture design, rapid material characterization, and life-cycle management tied with a digital strategy that will enable efficient development of products from cradle to grave. In addition, there are also challenges to overcome that must be addressed through a major, sustained research effort that is based solidly on both inferential and computational principles applied to design tailoring of functionally optimized structures. Current applications of structural materials in the aerospace industry demand the highest quality control of material microstructure, especially for advanced rotational turbomachinery in aircraft engines in order to have the best tailored material property. In this paper, deep convolutional neural networks were developed to accurately predict processing-structure-property relations from materials microstructures images, surpassing current best practices and modeling efforts. The models automatically learn critical features, without the need for manual specification and/or subjective and expensive image analysis. Further, in combination with generative deep learning models, a framework is proposed to enable rapid material design space exploration and property identification and optimization. The implementation must take account of real-time decision cycles and the trade-offs between speed and accuracy

arXiv.org e-Print Archive

Crossref

The Computer-Based Generation of Fonts in the Style of Kandinsky

Author: Gatys L. A.
Publication venue: 'MIT Press - Journals'
Publication date
Field of study

Crossref

Texture synthesis and the controlled generation of natural stimuli using convolutional neural networks

Author: Bethge M.
Ecker A.
Gatys L.
Publication venue
Publication date: 01/09/2015
Field of study

It is a long standing question how biological systems transform visual inputs to robustly infer high level visual information. Research in the last decades has established that much of the underlying computations take place in a hierarchical fashion along the ventral visual pathway. However, the exact processing stages along this hierarchy are difficult to characterise. Here we present a method to generate stimuli that will allow a principled description of the processing stages along the ventral stream. We introduce a new parametric texture model based on the powerful feature spaces of convolutional neural networks optimised for object recognition. We show that constraining spatial summary statistic on feature maps suffices to synthesise high quality natural textures. Moreover we establish that our texture representations continuously disentangle high level visual information and demonstrate that the hierarchical parameterisation of the texture model naturally enables us to generate novel types of stimuli for systematically probing mid-level vision

MPG.PuRe

Texture and art with deep neural networks

Author: Bethge M.
Ecker A.
Gatys L.
Publication venue: 'Elsevier BV'
Publication date: 01/10/2017
Field of study

Although the study of biological vision and computer vision attempt to understand powerful visual information processing from different angles, they have a long history of informing each other. Recent advances in texture synthesis that were motivated by visual neuroscience have led to a substantial advance in image synthesis and manipulation in computer vision using convolutional neural networks (CNNs). Here, we review these recent advances and discuss how they can in turn inspire new research in visual perception and computational neuroscience

MPG.PuRe

A Neural Algorithm of Artistic Style

Author: Bethge M.
Ecker A.
Gatys L.
Publication venue: 'Association for Research in Vision and Ophthalmology (ARVO)'
Publication date: 01/08/2016
Field of study

In fine art, especially painting, humans have mastered the skill to create unique visual experiences by composing a complex interplay between the content and style of an image. The algorithmic basis of this process is unknown and there exists no artificial system with similar capabilities. Recently, a class of biologically inspired vision models called Deep Neural Networks have demonstrated near-human performance in complex visual tasks such as object and face recognition. Here we introduce an artificial system based on a Deep Neural Network that creates artistic images of high perceptual quality. The system can separate and recombine the content and style of arbitrary images, providing a neural algorithm for the creation of artistic images. In light of recent studies using fMRI and electrophysiology that have shown striking similarities between performance-optimised artificial neural networks and biological vision, our work offers a path towards an algorithmic understanding of how humans create and perceive artistic imagery. The algorithm introduces a novel class of stimuli that could be used to test specific computational hypotheses about the perceptual processing of artistic style

MPG.PuRe

Texture Synthesis Using Convolutional Neural Networks

Author: Bethge M.
Ecker A.
Gatys L.
Publication venue
Publication date: 01/01/2016
Field of study

Here we introduce a new model of natural textures based on the feature spaces of convolutional neural networks optimised for object recognition. Samples from the model are of high perceptual quality demonstrating the generative power of neural networks trained in a purely discriminative fashion. Within the model, textures are represented by the correlations between feature maps in several layers of the network. We show that across layers the texture representations increasingly capture the statistical properties of natural images while making object information more and more explicit. The model provides a new tool to generate stimuli for neuroscience and might offer insights into the deep representations learned by convolutional neural networks

MPG.PuRe

Image Style Transfer Using Convolutional Neural Networks

Author: Bethge M.
Ecker A.
Gatys L.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/2016
Field of study

Rendering the semantic content of an image in different styles is a difficult image processing task. Arguably, a major limiting factor for previous approaches has been the lack of image representations that explicitly represent semantic information and, thus, allow to separate image content from style. Here we use image representations derived from Convolutional Neural Networks optimised for object recognition, which make high level image information explicit. We introduce A Neural Algorithm of Artistic Style that can separate and recombine the image content and style of natural images. The algorithm allows us to produce new images of high perceptual quality that combine the content of an arbitrary photograph with the appearance of numerous well-known artworks. Our results provide new insights into the deep image representations learned by Convolutional Neural Networks and demonstrate their potential for high level image synthesis and manipulation

Crossref

MPG.PuRe

Understanding Low- and High-Level Contributions to Fixation Prediction

Author: Bethge M.
Gatys L.
Kümmerer M.
Wallis T.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2017
Field of study

Understanding where people look in images is an important problem in computer vision. Despite significant research, it remains unclear to what extent human fixations can be predicted by low-level (contrast) compared to highlevel (presence of objects) image features. Here we address this problem by introducing two novel models that use different feature spaces but the same readout architecture. The first model predicts human fixations based on deep neural network features trained on object recognition. This model sets a new state-of-the art in fixation prediction by achieving top performance in area under the curve metrics on the MIT300 hold-out benchmark (AUC = 88, sAUC = 77, NSS = 2.34). The second model uses purely low-level (isotropic contrast) features. This model achieves better performance than all models not using features pretrained on object recognition, making it a strong baseline to assess the utility of high-level features. We then evaluate and visualize which fixations are better explained by lowlevel compared to high-level image features. Surprisingly we find that a substantial proportion of fixations are better explained by the simple low-level model than the stateof- the-art model. Comparing different features within the same powerful readout architecture allows us to better understand the relevance of low- versus high-level features in predicting fixation locations, while simultaneously achieving state-of-the-art saliency prediction

MPG.PuRe

Synaptic unreliability facilitates information transmission in balanced cortical populations

Author: Bethge M.
Ecker A.
Gatys L.
Tchumatchenko T.
Publication venue
Publication date: 01/10/2014
Field of study

Cortical neurons fire in a highly irregular manner, suggesting that their input is tightly balanced and changes in presynaptic firing rate are encoded primarily in the variance of the postsynaptic currents. Here we show that such balance has a surprising effect on information transmission: Synaptic unreliability which is ubiquitous in cortex and usually thought to impair neural communication actually increases the information rate. We show that the beneficial effect of noise is based on a very general mechanism which contrary to stochastic resonance does not rely on a threshold nonlinearity

MPG.PuRe